Fixing the Domain and Range of Properties in Linked Data by Context Disambiguation

نویسندگان

  • Alberto Tonon
  • Michele Catasta
  • Gianluca Demartini
  • Philippe Cudré-Mauroux
چکیده

The amount of Linked Open Data available on the Web is rapidly growing. The quality of the provided data, however, is generally-speaking not fundamentally improving, hampering its wide-scale deployment for many real-world applications. A key data quality aspect for Linked Open Data can be expressed in terms of its adherence to an underlying welldefined schema or ontology, which serves both as a documentation for the end-users as well as a fixed reference for automated processing over the data. In this paper, we first report on an analysis of the schema adherence of domains and ranges for Linked Open Data. We then propose new techniques to improve the correctness of domains and ranges by i) identifying the cases in which a property is used in the data with several different semantics, and ii) resolving them by updating the underlying schema and/or by modifying the data without compromising its retro-compatibility. We experimentally show the validity of our methods through an empirical evaluation over DBpedia by creating expert judgements of the proposed fixes over a sample of the data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fractional Order Generalized Thermoelastic Functionally Graded Solid with Variable Material Properties

In this work, a new mathematical model of thermoelasticity theory has been considered in the context of a new consideration of heat conduction with fractional order theory. A functionally graded isotropic unbounded medium is considered subjected to a periodically varying heat source in the context of space-time non-local generalization of three-phase-lag thermoelastic model and Green-Naghdi mod...

متن کامل

بررسی نقش انواع بافتار هم‌نویسه‌ها در تعیین شباهت بین مدارک

Aim: Automatic information retrieval is based on the assumption that texts contain content or structural elements that can be used in word sense disambiguation and thereby improving the effectiveness of the results retrieved. Homographs are among the words requiring sense disambiguation. Depending on their roles and positions in texts, homograph contexts could be divided to different types, wit...

متن کامل

An Epistemological Study of the Verse Tathir

The verse Tathir is regarded as one of the central verses in proving the infallibility of the Prophet and his future generation`s properties that have long been debated among the Fariqain commentators. The Sunni commentators, without any reasonable proof and only based on the context and etymology of the word al-bait, the house, have viewed all members of Quraysh in the Prophet's House, and in ...

متن کامل

Domain-adapted named-entity linker using Linked Data

We present REDEN, a tool for graph-based Named Entity Linking that allows for the disambiguation of entities using domainspecific Linked Data sources and different configurations (e.g. context size). It takes TEI-annotated texts as input and outputs them enriched with external references (URIs). The possibility of customizing indexes built from various knowledge sources by defining temporal and...

متن کامل

Optimization of Conventional Stabilizers Parameter of Two Machine Power System Linked by SSSC Using CHSA Technique

This paper presents a method for damping of low frequency oscillations (LFO) in a power system. The powersystem contains static synchronous series compensators (SSSC) which using a chaotic harmony searchalgorithm (CHSA), optimizes the lead-lag damping stabilizer. In fact, the main target of this paper isoptimization of selected gains with the time domain-based objective function, which is solve...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015